#q-learning residual